skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Li, Sha"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Graphs and texts are two key modalities in data mining. In many cases, the data presents a mixture of the two modalities and the information is often complementary: in e-commerce data, the product-user graph and product descriptions capture different aspects of product features; in scientific literature, the citation graph, author metadata, and the paper content all contribute to modeling the paper impact. 
    more » « less
  2. The application of high-power, few-cycle, long-wave infrared (LWIR, 8–20 µm) pulses in strong-field physics is largely unexplored due to the lack of suitable sources. However, the generation of intense pulses with >6 µm wavelength range is becoming increasingly feasible with the recent advances in high-power ultrashort lasers in the middle-infrared range that can serve as a pump for optical parametric amplifiers (OPA). Here we experimentally demonstrate the feasibility of this approach by building an OPA pumped at 2.4 µm that generates 93 µJ pulses at 9.5 µm, 1 kHz repetition rate with sub-two-cycle pulse duration, 1.6 GW peak power, and excellent beam quality. The results open a wide range of applications in attosecond physics (especially for studies of condensed phase samples), remote sensing, and biophotonics. 
    more » « less
  3. Event schemas are a form of world knowledge about the typical progression of events. Recent methods for event schema induction use information extraction systems to construct a large number of event graph instances from documents, and then learn to generalize the schema from such instances. In contrast, we propose to treat event schemas as a form of commonsense knowledge that can be derived from large language models (LLMs). This new paradigm greatly simplifies the schema induction process and allows us to handle both hierarchical relations and temporal relations between events in a straightforward way. Since event schemas have complex graph structures, we design an incremental prompting and verification method INCPROMPT to break down the construction of a complex event graph into three stages: event skeleton construction, event expansion, and event-event relation verification. Compared to directly using LLMs to generate a linearized graph, INCPROMPT can generate large and complex schemas with 7.2% F1 improvement in temporal relations and 31.0% F1 improvement in hierarchical relations. In addition, compared to the previous state-of-the-art closed-domain schema induction model, human assessors were able to cover ∼10% more events when translating the schemas into coherent stories and rated our schemas 1.3 points higher (on a 5-point scale) in terms of readability. 
    more » « less
  4. Abstract Phenotypic variation among species is a product of evolutionary changes to developmental programs1,2. However, how these changes generate novel morphological traits remains largely unclear. Here we studied the genomic and developmental basis of the mammalian gliding membrane, or patagium—an adaptative trait that has repeatedly evolved in different lineages, including in closely related marsupial species. Through comparative genomic analysis of 15 marsupial genomes, both from gliding and non-gliding species, we find that theEmx2locus experienced lineage-specific patterns of acceleratedcis-regulatory evolution in gliding species. By combining epigenomics, transcriptomics and in-pouch marsupial transgenics, we show thatEmx2is a critical upstream regulator of patagium development. Moreover, we identify differentcis-regulatory elements that may be responsible for driving increasedEmx2expression levels in gliding species. Lastly, using mouse functional experiments, we find evidence thatEmx2expression patterns in gliders may have been modified from a pre-existing program found in all mammals. Together, our results suggest that patagia repeatedly originated through a process of convergent genomic evolution, whereby regulation ofEmx2was altered by distinctcis-regulatory elements in independently evolved species. Thus, different regulatory elements targeting the same key developmental gene may constitute an effective strategy by which natural selection has harnessed regulatory evolution in marsupial genomes to generate phenotypic novelty. 
    more » « less
  5. Abstract Studies of laser-driven strong field processes subjected to a (quasi-)static field have been mainly confined to theory. Here we provide an experimental realization by introducing a bichromatic approach for high harmonic generation (HHG) in a dielectric that combines an intense 70 femtosecond duration mid-infrared driving field with a weak 2 picosecond period terahertz (THz) dressing field. We address the physics underlying the THz field induced static symmetry breaking and its consequences on the efficient production/suppression of even-/odd-order harmonics, and demonstrate the ability to probe the HHG dynamics via the modulation of the harmonic distribution. Moreover, we report a delay-dependent even-order harmonic frequency shift that is proportional to the time derivative of the THz field. This suggests a limitation of the static symmetry breaking interpretation and implies that the resultant attosecond bursts are aperiodic, thus providing a frequency domain probe of attosecond transients while opening opportunities in precise attosecond pulse shaping. 
    more » « less
  6. Schema induction builds a graph representation explaining how events unfold in a scenario. Existing approaches have been based on information retrieval (IR) and information extraction (IE), often with limited human curation. We demonstrate a human-in-the-loop schema induction system powered by GPT-3. We first describe the different modules of our system, including prompting to generate schematic elements, manual edit of those elements, and conversion of those into a schema graph. By qualitatively comparing our system to previous ones, we show that our system not only transfers to new domains more easily than previous approaches, but also reduces efforts of human curation thanks to our interactive interface. 
    more » « less